saveRDS(brfss_restrict, file = "path/brfss_restrict.rds")
Data Lab 10 - Medicaid and Health Pt.1
In the last Data Lab, we examined how Louisiana’s Medicaid expansion impacted insurance coverage across various demographic groups. Now, we’ll investigate whether Medicaid expansion improved healthcare access for Louisianans relative to residents in other Gulf South states.
Additionally, we’ll complete the data cleaning and preparation work that we’ll need for the quasi-experimental research design we’re going to use in Data Lab 11.
Ultimately, the goals of our policy analysis will be:
- Estimate the effect of Louisiana’s Medicaid expansion on insurance coverage.
- Estimate the effect of Louisiana’s Medicaid expansion on health care access.
- Estimate the effect of Louisiana’s Medicaid expansion on health outcomes.
Note: We can do everything we want to do in this Data Lab with code we’ve used in previous Data Labs. I don’t want you to use ChatGPT while working through the following exercises. Instead, I’d like you to think about what you’ve done in the previous Data Labs and how to apply those techniques to what we’re about to do here. If you ask ChatGPT for help, it will suggest commands that we haven’t used in this class and if those commands end up in your submission, you won’t receive the participation credit for this class.
Step 1: Create a New R Markdown Document for this Data Lab
Create a new R Markdown document and give it a YAML header that includes the title “HPAM 7660 Data Lab 10”, your name, the date, and “pdf_document” as the output format. You’ll submit a pdf of this R Markdown document once you’ve finished the Data Lab today.
Step 2: Load and Prepare the Data
I’ve created a data file for us to use in this Data Lab. The data file is called acs_datalab10.rds
and you can download it here.
This data comes from the Behavioral Risk Factor Surveillance Survey (BRFSS). We’ve used this data in previous Data Labs, but I’ve added few more variables here. Here are the variables included in the acs_datalab10.rds
file and their values:
- YEAR: survey year; 2012 through 2019.
- STATEFIP: state FIPS code.
- GENHLTH: self-rated health; 1 = excellent, 2 = very good, 3 = good, 4 = fair, 5 = poor.
- PHYSHLTH: Number of days in the past month that a respondent’s physical health was “not good”.
- MENTHLTH: Number of days in the past month that a respondent’s mental health was “not good”.
- HLTHPLN1: does the respondent have health insurance coverage; 1 = yes, 2 = no.
- PERSDOC2: does the respondent have a usual source of care; 1 = yes, 0 = no.
- MEDCOST: did the respondent forgo medical care in past year due to cost; 1 = yes, 0 = no.
- AGE: respondent age in years.
- RACE: respondent race/ethnicity; 1 = non-Hispanic white, 2 = non-Hispanic Black, 3 = non-Hispanic other race, 4 = non-Hispanic multiracial, 5 = Hispanic.
- EDUCA: respondent educational attainment; 1 = no school, 2 = grades 1 through 8, 3 = grades 9 through 11, 4 = high school graduate, 5 = some college, 6 = college graduate.
- INCOME: respondent’s household income; 1 = less than $10k, 2 = $10k to $15k, 3 = $15k to $20k, 4 = $20k to $25k, 5 = $25k to $35k, 6 = $35k to $50k, 7 = $50k to $75k, 8 = $75k or more.
- SEX: respondent sex; 1 = male, 2 = female.
Using this file, construct a “restricted” data set with the following characteristics:
- Include only respondents who live in the following states: Alabama, Florida, Georgia, Louisiana, Mississippi, and Texas.
- Include only respondents between the ages of 26 and 64.
- Include only respondents with household incomes below $35k.
- Create a new variable called “UNINSURED” that is equal to 1 if a respondent is uninsured and equal to 0 if the respondent is insured.
- Create a new variable called “PF_HLTH” that is equal to 1 if a respondent’s self-rated health is “poor” or “fair” and equal to 0 otherwise.
- Create a variable called “TREATMENT” that is equal to 1 if a respondent lives in Louisiana and is equal to 0 otherwise.
We’re going to use this restricted dataset in our next Data Lab, so let’s go ahead and save it using the following command:
Be sure to replace “path” with the path to the folder on your computer where you’d like to save the data. Also, note that I’m using a Mac, so I have a forward slash in my code. If you’re using a PC, you’ll probably need to use a double backslash.
We’re also going to need to use our restricted ACS dataset that we created in Data Lab 9 in our next Data Lab. So, if you haven’t done so already, you’ll want to recreate that dataset and save it to your computer. You don’t need to do this today, but you should do it before our next Data Lab.
Step 3: Plot Trends in Insurance Coverage
Like last time, we’re going to plot trends in insurance coverage rates for Louisiana and for the other Gulf South states. However, our plot is going to look a little different this time. Instead of plotting trends for Louisiana and the Gulf South states in separate figures, let’s plot them both on the same figure. We can do this by using the “TREATMENT” variable that we defined in the previous step.
First, you’ll need to calculate mean uninsurance rates by year and by treatment status. When doing so, you’ll need to account for the fact that some respondents might have missing values for “UNINSURED” (as well as for other variables). This means you’ll want to include the na.rm = TRUE
statement when calculating mean values.
Once you’ve done that, you can use the following code to plot uninsurance rates over time for Louisiana and the Gulf South States:
ggplot(df, aes(x = YEAR)) +
geom_line(data = subset(df, TREATMENT == 1),
aes(y = MEAN_UNINSURED, color = "Louisiana")) +
geom_point(data = subset(df, TREATMENT == 1),
aes(y = MEAN_UNINSURED, color = "Louisiana")) +
geom_line(data = subset(df, TREATMENT == 0),
aes(y = MEAN_UNINSURED, color = "Gulf South")) +
geom_point(data = subset(df, TREATMENT == 0),
aes(y = MEAN_UNINSURED, color = "Gulf South")) +
geom_vline(xintercept = 2016, linetype = "dotted") +
labs(title = "Insurance Coverage in Louisiana & Gulf South States",
x = "Year",
y = "Share Uninsured",
color = "") +
ylim(0, 0.6) +
theme_minimal()
Note that you’ll need to replace “df” in the code above with whatever you named the data frame that you’re trying to plot.
- Describe the pattern you see in the figure.
- Is the pattern of uninsurance rates in the BRFSS data similar to the pattern you observed in the ACS data from Data Lab 9?
Step 4: Plot Trends in Usual Source of Care
Now let’s look to see if changes in insurance coverage in Louisiana translated into improvements in health care access.
Modify the code you used in Step 3 to plot trends in whether respondents reported having a usual source of care in Louisiana and for the other Gulf South states.
- Describe the pattern you see in the figure. Does Medicaid expansion in Louisiana appear to be associated with changes in reporting a usual source of care?
Step 5: Plot Trends in Forgoing Care Due to Cost
Lastly, plot trends in whether respondents reported forgoing care at any time over the past year due to cost in Louisiana and for the other Gulf South states.
- Describe the pattern you see in the figure. Does Medicaid expansion in Louisiana appear to be associated with changes in forgoing medical care due to cost?
Step 6: Knitting to PDF
Once you’ve finished answering the questions, knit your R Markdown document to a PDF and upload the PDF here. Your document should include all of the tables and figures you created in this Data Lab along with your answers to the questions.
Key Takeaways
We’ve now explored changes in the uninsurance rate associated with Medicaid expansion using two datasets (ACS & BRFSS) and changes in Medicaid coverage rates using the ACS data. It seems pretty clear that Medicaid expansion in Louisiana increased Medicaid coverage and reduced the uninsurance rate in the state. In the next Data Lab, we’ll quantify this change.
We also examined how changes in insurance coverage associated with Medicaid expansion might translate into changes in access to care. Improved access to health care services represents a logical pathway through which Medicaid expansion might improve health.
In our next (and final!) Data Lab, we’ll explore the connection between Medicaid coverage and health by exploiting Louisiana’s Medicaid expansion as a “natural experiment”. To do so, we’ll use a quasi-experimental reserach design known as “difference-in-differences” that will compare changes in Louisiana to changes in the other Gulf South states that have not yet expanded Medicaid coverage.